
394 WE CLAIM: 



395 1. (canceled) rewritten/re-presented in claim 5 

396 2. (canceled) rewritten/re-presented in claim 6 

397 3. (canceled) rewritten/re-presented in claim 7 

398 4. (canceled) rewritten/re-presented in claim 8 

399 5. (canceled) rewritten/re-presented in claim 9 

400 6. (canceled) rewritten/re-presented in claim 10 

401 7. (canceled) rewritten/re-presented in claim 11 

402 8. (canceled) rewritten/re-presented in claim 12 

403 9. (re-presented - formerly independent claim 5) A method of recognizing an observed 

404 subsequence as being generated by one of a set of Hidden Markov Models (HMM), 

405 characterized by: 

406 • the fact that it searches the subsequence, Q, that offer the minimization of an 

407 inverse confidence measure, over all possible matchings, 

408 • where the inverse confidence measure is one of 

409 1) the accumulated posterior, normalized with the length of the matched sub- 

410 sequence X% (aka. 'simple normalization 5 ) 

411 j^j^log P(Q\Xi) 
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2) partitioning the states in a HMM into phonems, having a function 
Phonemes(Q) that returns the segmentation of a path Q in the HMM into 
phonems, and computing one of: 

2a) the worst average match in a phoneme, called 'real fitting 5 , 



2b) double normalization of the accumulated posterior over the number of 



ej is the time frame where Q enters phoneme j, and bj is the exit time 
frame from each phoneme, j, 



and allows for the optional revaluation of the alternatives that offer the high- 
est scores of a mentioned confidence measure on the basis of another confidence 
measure, 

and when based on the confidence measure called 'simple normalization' uses a 
method that applies Viterbi decoding for a HMM obtained by extending the initial 
one with a filler state just after start and one just before the termination state, 
and estimates the emission probability of the filler states in an iterative manner 
as being equal to the inverse confidence measure in the previous iteration, 
and where the emission probability in the filler states in the first iteration can be 
initialized to any floating point number, but the iteration stops: 



argmin( 
Q 



max 

Q€Phonemes(Q) 



|{*k*€Q}| 



phonemes, J, and over the number of acoustic samples, ej - bj + 1, where 
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432 i at convergence yielding the estimation of a keyword's boundaries and score 

433 as the obtained boundaries and score of non-filler states of the HMM, 

434 ii when the confidence measure descends under a threshold value, T, estimating 

435 only the keyword existence, 

435 iii when the emission probability of filler states, e 0 is initialized with T and is 

437 reestimated, as value of e x at the end of the first iteration, to be higher than 

438 T deciding keyword inexistence, 

439 • or for any of the three confidence measures: 'simple normalization', 'double nor- 

440 malization' or 'real fitting', uses a beam-search-like algorithm that considers the 

441 emission probability of the filler state as zero, computes progressively for each 

442 pair of sample and state of HMM a set of possible alternatives paths to reach it, 

443 the computation of this set is based on the sets of paths that lead to the states that 

444 can be associated to the previous sample and extended with transitions allowed 

445 by the analyzed HMM, 

446 where this set can be reduced by using appropriate (safe) rules for the given 

447 confidence measure, ensuring the correctness of the inference, 

448 and where this set can be also reduced by using heuristics, for speeding up the 

449 computation despite the risk of reducing the theoretical quality of the recognition, 

450 heuristics of which a fast version stores only the best match, 

451 and for all confidence measures one can prune the set of alternatives with safe rules 

452 guaranteeing optimality, where: 
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the 'simple normalization' confidence measure with beam-search is used with a 
safe pruning that discards a path Q 1 given the existence of a path Q 2 whenever 
S 2 < Si and L x < L2, where S x and L x respectively S 2 and L 2 are the minus of 
the cumulated log of posteriors along the paths, and the lengths of the paths, for 
the paths Q x respectively Q 2 , and which can be optimized by sorting competing 
paths based on their cost 

the 'double normalization' confidence measure on HMMs where no path skips any 
phoneme is used with a safe pruning that discards a path Q x given the existence 
of a path Q 2 whenever one of the following tests succeed: 

(a) l 2 >l u A > 0, B < 0 and L\A + L C B + C > 0 

(b) l 2 >l u A > 0, B > 0 and C > 0 

(c) l 2 >l u A < 0, C > 0 and L 2 A+ LB + C>Q 

(d) l 2 >l u A = Q, B < 0 and LB + C > 0 

where we denote by a u p 1} Z X) respectively by a 2 , p 2 and l 2 the confidence measure 
for the previously visited phonemes, the posterior in the current phoneme and 
the length in the current phoneme for the path Q x , respectively the path Q 2i 
and we also use the notations A = a x - a 2 , B — {a x - a 2 )(l x + l 2 ) + p x - p 2) 
C=(ai - a 2 )l x l 2 +pil 2 - p 2 l X) L = L max - max{/ 1; / 2 }, L c = -B/2A and L max is 
the maximum acceptable length for a phoneme, 

the 'double normalization 5 confidence measure on HMMs where some paths skip 

phonemes is used with a safe pruning that discards a path Q x given the existence 

of a path Q 2 whenever l 2 > l Xl A > 0, p x > p 2 respectively F 2 >F U 

24 



where Fx respectively F 2 are the number of visited phonemes for paths Q\ and 

• the 'real fitting' is used with the safe pruning: Q 2 is discarded in favor of another 
path Qi if the confidence measure of the Real Fitting for the previous phonemes 
is inferior (higher in value) for Q 2 compared with Qi, and if pi < p 2 and l 2 <h, 
where pi, /1, respectively p 2; k represent the minus of the logarithm of the cumu- 
lated posterior respectively the number of frames in the current phoneme for the 
path Qi respectively Q 2y 

• and besides the previously mentioned safe pruning, heuristic prunings are also 
used for removing paths when p > L max P max in any state or when 2 > P max at 
the output from a phoneme, 

where p and 1 are the values in the current phoneme for the minus of the logarithm 
of cumulated posterior and for the length of the path that is discarded. 

(re-presented - formerly dependent claim 6) The method of claim 9, where the method 
is used to estimate the existence of keywords and their position in utterances, using 
Hidden Markov Models that model keywords. 

(re-presented - formerly dependent claim 7) The method of claim 9, where the method 
is used to estimate the existence of biomolecular subsequences and their position in the 
chains of DNA using hidden Markov models to model the searched subsequences, and 
where these models can be obtained by trivial translation from generalized profiles. 

(re-presented - formerly dependent claim 8) The method of claim 9, where it carries out 
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496 the estimation of the existence of objects and their position in images, characterized 

497 by the fact that 

498 • it uses models of objects as subsequences represented by Hidden Markov Models, 

499 • namely sections through views of objects are modeled by Hidden Markov Models, 

500 • it uses emission probabilities based on a distance computed between colors, sim- 

501 pie distances being yield by a Gaussian with median at the target color, or a 

502 normalized inverse of the Euclidean distance in the RGB space, 

503 • wherein the Hidden Markov Models that model the objects can be structured of 

504 distinct regions, that play in the frame of the method the role of the phonemes 

505 in claim 9, 

506 • and wherein the models of the objects can be modified in a dynamic manner during 

507 decoding with respect to the transition properties (existence and probability) on 

508 the basis of the so far accumulated information in the process. 
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